On the Behavior of Certain Metric on the Permutations Group
نویسنده
چکیده
When a new metric is introduced, often there is a "hidden variable" in the similarity relation (frequently depends on the specific area of research), so that we should always speak of similarity with respect to some property, and there is a plethora of measures in part because researchers are often inexplicit on this point. On the other hand, one should have some knowledge about the nature of the problem to be solved. A result that is mathematically sound may be highly implausible and might not reflect what is known about the analyzed process. In information sciences, in computational linguistics or in some automatically categorization problems (especially based on combining rankings) we have to take into account the natural tendency of the objects to place the most important information in the first part of the message. So, if the differences between two objects are at the top (i.e., in essential points), the distance has to have a bigger value then when the differences are at the bottom of the objects. A similar situation can be found in genomics, where the difference on the first positions between two codons is more important than the difference on the last positions (Marcus, 1974). Following the upper motivations, we have introduced the rank distance [Dinu, 2003] as a similarity measure on rankings with linguistics and biological motivations. In some related papers we have investigated the behavior of this metrics in topics like aggregation of classifications, categorization [Dinu 2003], computational linguistics (especially the similarity of Romance languages, [Dinu and Dinu 2005]), the DNA sequence comparison [Dinu and Sgarro 2006], the similarity of trees structures [Dinu 2005]. From a computational point of view, rank distance has a good behavior w.r.t. the so-called "median string problem", i.e. the median string can be computed in a polynomial time by using rank distance [Dinu and Manea, 2006] (we remind that the same problem is NP-hard via other metrics, like Levenshtein, Kendall, etc.) In many cases (computational linguistics, statistics, coding, classification task, etc.), it is enough to use metrics that work on string without repetitions (i.e. rankings or permutations). In this paper we investigate some mathematical properties of the rank distance related on its expected and maximum values on the permutations group. We compute these values and show when they are reached.
منابع مشابه
New best proximity point results in G-metric space
Best approximation results provide an approximate solution to the fixed point equation $Tx=x$, when the non-self mapping $T$ has no fixed point. In particular, a well-known best approximation theorem, due to Fan cite{5}, asserts that if $K$ is a nonempty compact convex subset of a Hausdorff locally convex topological vector space $E$ and $T:Krightarrow E$ is a continuous mapping, then there exi...
متن کاملOn quasi-Einstein Finsler spaces
The notion of quasi-Einstein metric in physics is equivalent to the notion of Ricci soliton in Riemannian spaces. Quasi-Einstein metrics serve also as solution to the Ricci flow equation. Here, the Riemannian metric is replaced by a Hessian matrix derived from a Finsler structure and a quasi-Einstein Finsler metric is defined. In compact case, it is proved that the quasi-Einstein met...
متن کاملA Stream Cipher Based on Chaotic Permutations
In this paper we introduce a word-based stream cipher consisting of a chaotic part operating as a chaotic permutation and a linear part, both of which designed on a finite field. We will show that this system can operate in both synchronized and self-synchronized modes. More specifically, we show that in the self-synchronized mode the stream cipher has a receiver operating as an unknown input o...
متن کاملROBUSTNESS OF THE TRIPLE IMPLICATION INFERENCE METHOD BASED ON THE WEIGHTED LOGIC METRIC
This paper focuses on the robustness problem of full implication triple implication inference method for fuzzy reasoning. First of all, based on strong regular implication, the weighted logic metric for measuring distance between two fuzzy sets is proposed. Besides, under this metric, some robustness results of the triple implication method are obtained, which demonstrates that the triple impli...
متن کاملSharply $(n-2)$-transitive Sets of Permutations
Let $S_n$ be the symmetric group on the set $[n]={1, 2, ldots, n}$. For $gin S_n$ let $fix(g)$ denote the number of fixed points of $g$. A subset $S$ of $S_n$ is called $t$-emph{transitive} if for any two $t$-tuples $(x_1,x_2,ldots,x_t)$ and $(y_1,y_2,ldots ,y_t)$ of distinct elements of $[n]$, there exists $gin S$ such that $x_{i}^g=y_{i}$ for any $1leq ileq t$ and additionally $S$ is called e...
متن کاملComposite Kernel Optimization in Semi-Supervised Metric
Machine-learning solutions to classification, clustering and matching problems critically depend on the adopted metric, which in the past was selected heuristically. In the last decade, it has been demonstrated that an appropriate metric can be learnt from data, resulting in superior performance as compared with traditional metrics. This has recently stimulated a considerable interest in the to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006